Search CORE

452 research outputs found

Approximation and Streaming Algorithms for Projective Clustering via Random Projections

Author: Kerber Michael
Raghvendra Sharath
Publication venue
Publication date: 08/07/2014
Field of study

Let

P

be a set of

n

points in

\mathbb{R}^d

. In the projective clustering problem, given

k, q

and norm

\rho \in [1,\infty]

, we have to compute a set

\mathcal{F}

k

q

-dimensional flats such that

(\sum_{p\in P}d(p, \mathcal{F})^\rho)^{1/\rho}

is minimized; here

d(p, \mathcal{F})

represents the (Euclidean) distance of

p

to the closest flat in

\mathcal{F}

. We let

f_k^q(P,\rho)

denote the minimal value and interpret

f_k^q(P,\infty)

to be

\max_{r\in P}d(r, \mathcal{F})

. When

\rho=1,2

and

\infty

and

q=0

, the problem corresponds to the

k

-median,

k

-mean and the

k

-center clustering problems respectively. For every

0 < \epsilon < 1

S\subset P

and

\rho \ge 1

, we show that the orthogonal projection of

P

onto a randomly chosen flat of dimension

O(((q+1)^2\log(1/\epsilon)/\epsilon^3) \log n)

will

\epsilon

-approximate

f_1^q(S,\rho)

. This result combines the concepts of geometric coresets and subspace embeddings based on the Johnson-Lindenstrauss Lemma. As a consequence, an orthogonal projection of

P

to an

O(((q+1)^2 \log ((q+1)/\epsilon)/\epsilon^3) \log n)

dimensional randomly chosen subspace

\epsilon

-approximates projective clusterings for every

k

and

\rho

simultaneously. Note that the dimension of this subspace is independent of the number of clusters~

k

. Using this dimension reduction result, we obtain new approximation and streaming algorithms for projective clustering problems. For example, given a stream of

n

points, we show how to compute an

\epsilon

-approximate projective clustering for every

k

and

\rho

simultaneously using only

O((n+d)((q+1)^2\log ((q+1)/\epsilon))/\epsilon^3 \log n)

space. Compared to standard streaming algorithms with

\Omega(kd)

space requirement, our approach is a significant improvement when the number of input points and their dimensions are of the same order of magnitude.Comment: Canadian Conference on Computational Geometry (CCCG 2015

arXiv.org e-Print Archive

CiteSeerX

MPG.PuRe

Counting tropical elliptic plane curves with fixed j-invariant

Author: Kerber Michael
Markwig Hannah
Publication venue
Publication date: 01/01/2009
Field of study

In complex algebraic geometry, the problem of enumerating plane elliptic curves of given degree with fixed complex structure has been solved by R.Pandharipande using Gromov-Witten theory. In this article we treat the tropical analogue of this problem, the determination of the number of tropical elliptic plane curves of degree d and fixed ``tropical j-invariant'' interpolating an appropriate number of points in general position. We show that this number is independent of the position of the points and the value of the j-invariant and that it coincides with the number of complex elliptic curves. The result can be used to simplify Mikhalkin's algorithm to count curves via lattice paths in the case of rational plane curves.Comment: 34 pages; minor changes to match the published versio

arXiv.org e-Print Archive

Crossref

Polynomial-Sized Topological Approximations Using The Permutahedron

Author: Choudhary Aruni
Kerber Michael
Raghvendra Sharath
Publication venue
Publication date: 01/01/2016
Field of study

Classical methods to model topological properties of point clouds, such as the Vietoris-Rips complex, suffer from the combinatorial explosion of complex sizes. We propose a novel technique to approximate a multi-scale filtration of the Rips complex with improved bounds for size: precisely, for

n

points in

\mathbb{R}^d

, we obtain a

O(d)

-approximation with at most

n2^{O(d \log k)}

simplices of dimension

k

or lower. In conjunction with dimension reduction techniques, our approach yields a

O(\mathrm{polylog} (n))

-approximation of size

n^{O(1)}

for Rips filtrations on arbitrary metric spaces. This result stems from high-dimensional lattice geometry and exploits properties of the permutahedral lattice, a well-studied structure in discrete geometry. Building on the same geometric concept, we also present a lower bound result on the size of an approximate filtration: we construct a point set for which every

(1+\epsilon)

-approximation of the \v{C}ech filtration has to contain

n^{\Omega(\log\log n)}

features, provided that

\epsilon <\frac{1}{\log^{1+c} n}

for

c\in(0,1)

.Comment: 24 pages, 1 figur

arXiv.org e-Print Archive

Dagstuhl Research Online Publication Server

MPG.PuRe